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DETAILED ACTION 



Claim Rejections - 35 USC §102 
1. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form 
the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in 

(1) an apphcation for patent, published under section 122(b), by another filed in the United States 
before the invention by the applicant for patent or 

(2) a patent granted on an application for patent by another filed in the United States before the 
invention by the applicant for patent, except that an international apphcation filed under the treaty 
defined in section 35 1(a) shall have the effects for purposes of this subsection of an application filed in 
the United States only if the international application designated the United States and was published 
under Article 2 1 (2) of such treaty in the English language. 



2. Claim 1-2, 6, and 16 is rejected under 35 U.S.C. 102(e) as being anticipated by 
Yourlo (U.S. Patent No. 6,201,176.) 

As to claim 1, Yourlo teaches a method for determining similarity between a 
plurality of musical works (see Abstract, and see column 1, lines 5-10) comprising 
the steps of: 

obtaining respective digitized audio files of the plurality of musical works (see 
column 1, lines 37-47, and see column 7, lines 64-66); 

for each musical work in the plurality, forming (i) a spectral representation from 
the corresponding audio file (see column 2, lines 29-32, and see column 7, Unes 9-21) 
and (ii) a rhythmic beat representation from the corresponding audio file (see column 
1, lines 43-47, and see column 5, lines 37-42); 
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for a given musical work of interest: 

(a) comparing its spectral representation to the spectral representations of the 
musical works in the plurality (see column 2, lines 3-9, and see column 2, lines 29- 
43); 

(b) comparing its rhythmic beat representation to the rhythmic beat 
representations of the musical works in the plurality (see column 6, lines 30-38); and 

(c) summing, including respective weighting of results of the comparisons in (a) 
and (b), (see column 5, lines 58-65, and see column 6, lines 49-53) the summed 
results providing an indication of which musical works in the plurality are similar to 
the given musical work of interest (see column 10, lines 16-57, and see column 11, 
lines 10-26.) 



As to claims 2 and 16, Yourlo teaches wherein forming a spectral representation 
includes dividing the corresponding audio file into a plurality of fi-ames (see column 
5, line 66 through column 6, line 5.) 

As to claim 6, Yourlo teaches further comprising performing a windowing 
function on each fi-ame (see column 3, lines 20-28.) 



3. 



Claim Rejections - 35 USC §103 
The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 
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(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 



4. Claims 3-5, 7-13, 17-32, 35-36, and 39-48 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Yourlo (U.S. Patent No. 6,201, 176) in view of Hoory et al 
(U.S. Patent No. 6,678,655.) 

As to claims 3 and 17, Yourlo does not teach converting each frame to a spectral 
representation to obtain a plurality of spectral representations for the audio file. 

Hoory et al teaches a method for speech recognition (see Abstract), in which he 
teaches converting each frame to a spectral representation to obtain a plurality of 
spectral representations for the audio file (see column 2, lines 5-20.) 

Therefore, it would have been obvious to a person having ordinary skill in the art 
at the time the invention was made to have modified Yourlo to include converting 
each fi-ame to a spectral representation to obtain a plurality of spectral representations 
for the audio file. 

It would have been obvious to a person having ordinary skill in the art at the time 
the invention was made to have modified Yourlo by the teaching of Hoory et al . 
because converting each frame to a spectral representation to obtain a plurality of 
spectral representations for the audio file, would enable the system to assign weights 
to a plurality of audio file features, for comparing and retrieval of an audio file best 
matching the desired audio features. 
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As to claim 4, Yourlo as modified, teaches wherein the spectral representation 
includes a vector of Mel-frequency cepstral coefficients (see Hoory et al> column 2, 
lines 55-62, and see column 4, lines 39-42.) 



As to claims 5 and 18, Yourlo as modified, teaches wherein each spectral 
representation includes a plurality of Mel-frequency cepstral coefficients (see Hoory 
et al, column 5, lines 10-14.) 

As to clahn 7, Yourlo does not teach applying a Hamming window on each frame. 

Hoory et al teaches applying a Hamming window on each frame (see column 4, 
lines 45-54, and see column 5, lines 35-39.) 

Therefore, it would have been obvious to a person having ordinary skill in the art 
at the time the invention was made to have modified Yourlo to include applying a 
Hamming window on each frame. 

It would have been obvious to a person having ordinary skill in the art at the time 
the invention was made to have modified Yourlo by the teaching of Hoory et al , 
because applying a Hamming window on each frame, would result in the line 
spectrum, corresponding to each base function sampled at the pitch frequency 
multiples, to be converted to a DFT spectrum (see Hoory et al, column 5, lines 35- 
38.) 
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As to claim 8, Yourlo does not teach applying a pre-emphasis on each frame. 
Hoory et al teaches applying a pre-emphasis on each frame (see column 4, lines 
45-50.) 

Thereft)re, it would have been obvious to a person having ordinary skill in the art 
at the time the invention was made to have modified Yourlo to include applying a 
pre-emphasis on each frame. 

It would have been obvious to a person having ordinary skill in the art at the time 
the invention was made to have modified Yourlo by the teaching of Hoory et al , 
because applying a pre-emphasis on each frame, would pre-emphasize the input 
speech signal, which is then fed to the windowing for Ham transformation (see Hoory 
et al> column 4, lines 46-50.) 

As to claim 9, Yourlo as modified, teaches subjecting data from each frame to a 
Fast Fourier Transform fiinction to obtain a frequency domain signal for each frame 
(see Hoory et al, column 4, lines 50-54.) 

As to claim 10, Yourlo as modified, teaches warping a log amplitude of each 
frequency signal to a Mel-frequency scale (see Hoory et al, column 4, lines 54-66.) 

As to claim 11, Yourlo as modified, teaches subjecting the warped frequency 
fimction to a second Fast Fourier Transform to obtain a parameter set of 
Mel-frequency cepstral coefficients (see Hoory et al column 5, Imes 35-64.) 
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As to claim 12, Yourlo as modified, teaches subjecting the frequency domain 
signal for each frame to a set of triangular filters to obtain a pluraUty of 
Mel-frequency spaced components (see Hoorv et al > column 4, lines 50-54, and see 
column 5, line 40 through column 6, line 9,) 



As to claim 13, Yourlo as modified, teaches subjecting the Mel-frequency spaced 
components to a discrete cosine transform function to obtain a plurality of Mel- 
frequency cepstral coefficients (see Hoorv et al . column 5, lines 15-20.) 



As to claim 19, Yourlo as modified, teaches computing a similarity matrix for the 
audio file (see Yourlo, column 4, Unes 6-7.) 



As to claims 20 and 21, Yourlo as modified, teaches computing a beat 
spectrogram for the audio file (see Yourlo. column 12, lines 1-9.) 



As to claim 22, Yourlo as modified, teaches normalizing the histogram to account 
for the total number of frames of the audio file 9see Yourlo , column 8, line 58 
through column 9, line 9.) 

As to claim 23, Yourlo as modified, teaches calculating a distance between a pair 
of histograms (see Yourlo . column 13, lines 16-37.) 
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As to claim 24, Yourlo as modified, teaches wherein calculating the distance 
includes calculatmg the closest distance between the pair of histograms (see Yourlo , 
column 14, lines 28-33.) 

As to claim 25, Yourlo as modified, teaches wherein the closest distance is the 
minimum of the sum of absolute differences between bins of each histogram 
calculated over a range of scalings of each histogram (see Yourlo. column 14, lines 
34-47.) 

As to claim 26, Yourlo as modified, teaches applying a function to each histogram 
to weight certain bins (see Yourlo, column 6, lines 49-53.) 

As to claim 27, Yourlo as modified, teaches scaling each histogram at least twice 
to allow for slight differences between musical works (see Yourlo, column 1 1, lines 



As to claim 28, Yourlo as modified, teaches wherein for each scale factor, one 
histogram is resampled by a factor and compared to the unsealed histogram (see 
Yourlo, column 6, lines 14-20.) 



36-67.) 



As to claim 29, Yourlo as modified, teaches generating a set of similar musical 
works (see Yourlo, column 10, Une 58 through column 1 1, line 9.) 
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As to claims 30, 41 and 44, Yourlo as modified, teaches visually displaying the 
musical works in a manner illustrating relative similarities or dissimilarities (see 
Yourlo, column 12, lines 33-44.) 



As to claim 31, Yourlo as modified, teaches calculating a relative distance 
between each pair of musical works (see Yourlo . Abstract, and see column 3, lines 
19-38.) 

As to claim 32, Yourlo as modified, teaches constracting a matrix of song 
similarity based on the relative distance (see Yourlo . column 2, lines 51-57.) 

As to claims 35 and 36, the appUcant is directed to the remarks and discussions 
made in claim 1 above. 



As to claim 39, Yourlo as modified teaches further comprising providing a 
respective reliability measure associated with each representation (see Yourlo. 
column 8, line 63 through column 9, Une 9.) 



As to claim 40, Yourlo as modified, teaches wherein the step of summing 
includes weighting resuhs of the comparisons as a function of reliability measures of 
the representations compared (see Yourlo . column 9, line 10 through column 1 1, line 
57.) 
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As to claim 42, the applicant is directed to the remarks and discussions made in 
claim 1 above. 

As to claim 43, Yourlo as modified, teaches weighting the summed results (see 
Yourlo, column 6, hnes 49-63,) 

As to claim 45, Yourlo teaches a computer program product for determining 
similarity between a plurality of musical works, the computer program product 
including a computer usable medium having computer readable code thereon (see 
column 15, lines 28-42), including program code which: 

For the remaining steps of this claim, the applicant is directed to the remarks and 
discussions made in claim 1 above. 

As to claim 46, Yourlo teaches a computer data signal embodied in a carrier wave 
for (see column 5, line 66 through column 6, line 14) determining sunilarity between 
a pluraUty of musical works (see Abstract), comprising program code (see column 22, 
Imes 14-28.): 

For the remaining steps of this claim, the applicant is directed to the remarks and 
discussions made in claim 1 above. 



As to claim 47, Yourlo teaches a computer system (see column 15, lines 28-55) 
comprising: 

a processor (see column 15, Une 66 through column 16, line 1; 
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a memory system connected to the processor (see column 16, lines 1-2); and 
a computer program, in the memory (see column 20, lines 29-32) which 

determines similarity between a plurality of musical works. 

For the remaining steps of this claim, the applicant is directed to the remarks and 

discussions made in claim 1 above. 

As to claim 48, Yourlo teaches a system for determining similarity between a 
plurality of musical works, (see Abstract) the system comprising: 

For the remaining steps of this claim, the appUcant is directed to the remarks and 
discussions made in claim 1 above. 



5. Claims 14-15, and 33-34 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Yourlo (U.S. Patent No. 6,201,176) in view of Hoorv et al (U.S. Patent No. 
6,678,655), as applied to claims 3-5, 7-13, 17-32, 35-36, and 39-48 above, and further 
in view of Walker et al (U.S. Patent No. 6,710,822.) 

As to claim 14, Yourlo as modified, still does not teach clustering the spectral 
representations of the audio file to obtain a spectral signature for the audio file. 

Walker et al teaches signal processing method (see Abstract), in which he teaches 
clustering the spectral representations of the audio file to obtain a spectral signature 
for the audio file (see column 5, lines 16-35, and see column 9, lines 42-56.) 

Therefore, it would have been obvious to a person having ordinary skill in the art 
at the time the invention was made to have modified Yourlo as modified, to include 
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clustering the spectral representations of the audio file to obtain a spectral signature 
for the audio file. 

It would have been obvious to a person having ordinary skill in the art at the time 
the invention was made to have modified Yourlo as modified, by the teaching of 
Walker et ai, because clustering the spectral representations of the audio file to obtain 
a spectral signature for the audio file, would enable the system to distinguish voice 
segments (see Walker et al, column 5, lines 26-28.) 

As to claim 15, Yourlo as modified, teaches comparing the spectral signatures of 
two different audio files using an Earth Mover's Distance (see Walker et al , column 
12, lines 1-45.) 



As to claim 33, Yourlo as modified still does not teach performing a Multi- 
dimensional scaling on the matrix to obtain coordinates in K-dimensional space for 
each musical work, one coordinate per song. 

Walker et al teaches performing a Multi-dimensional scaling on the matrix to 
obtain coordinates in K-dimensional space for each musical work, one coordinate per 
song (see column 1 1, line 41 through column 12, Une 7.) 

Therefore, it would have been obvious to a person having ordinary skill in the art 
at the time the invention was made to have modified Yourlo as modified, to include 
performing a Muhi-dimensional scaling on the matrix to obtain coordinates in 
K-dimensional space for each musical work, one coordinate per song. 
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It would have been obvious to a person having ordinary skill in the art at the time 
the invention was made to have modified Yourlo as modified, by the teaching of 
Walker et al. because performing a Multi-dimensional scaling on the matrix to obtain 
coordinates in K-dimensional space for each musical work, one coordinate per song, 
would enable the system to measure differences between various features of various 
songs. 

As to claim 34, Yourlo as modified teaches plotting the coordinates (see Yourlo, 
column 9, lines 13-15.) 



6. Claims 37 and 38 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Yourlo (U.S. Patent No. 6,20L176) in view of Hoorv et al (U.S. Patent No. 
6,678,655), as applied to claims 3-5, 7-13, 17-32, 35-36, and 39-48 above, and fiirther 
in view of Bloom et al (U.S. Patent No. 4,591,928.) 

As to claim 37, Yourlo as modified still does not teach the step of preprocessing 
the audio files before formmg the different representations for each musical work. 

Bloom et al teaches a method of processing audio signals (see Abstract), in which 
he teaches preprocessing the audio files before forming the different representations 
for each musical work (see column 22, lines 30-46, and see column 23, lines 22-60.) 

Therefore, it would have been obvious to a person having ordinary skill in the art 
at the time the invention was made to have modified Yourlo as modified, to include 
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preprocessing the audio files before forming the different representations for each 
musical work. 

It would have been obvious to a person having ordinary skill in the art at the time 
the invention was made to have modified Yourlo as modified, by the teaching of 
Bloom et al, because preprocessing the audio files before forming the different 
representations for each musical work, would allow the system to filter out any noise 
in the audio file prior to using the file as mput to the comparison unit for similarities 
between musical works. 

As to claim 38, Yourlo as modified, teaches wherein the step of preprocessing 
mcludes omitting relatively long pauses (see Bloom et al> column (see column 22, 
lines 30-46, and see column 23, lines 22-60.) 



7. Any inquiries concerning this communication or earUer communications fi-om the 
examiner should be directed to Tony Mahmoudi whose telephone number is (703) 
305-4887. The examiner can normally be reached on Mondays-Fridays from 08:00 
am to 04:30 pm. 

If attempts to reach the examiner by telephone are unsuccessfiil, the examiner's 
supervisor, Dov Popovici, can be reached at (703) 305-3830, 



Conclusion 



June 25, 2004 
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